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METHOD FOR ENCODING AND DECODING 
MOVING PICTURE SIGNALS 

BACKGROUND OF THE INVENTION 

This application is a continuation of Reissue Application 
Serial No. 09/925.423. filed August 10, 200 1. 

(1) Field of the Invention 

The present invention relates to a method for encoding 
and decoding digital moving picture signals for use in TV 
phones, TV conferences and the like. 

(2) Description of the Prior Art 

In a general method for encoding digital moving picture 
signals, a frame of inputted moving picture is divided into 
plural blocks each composed of NxM pixels, and processes 
of motion detection, prediction, orthogonal transform, 
quantization, variable length coding, etc. are conducted on 
each block. 

In a general method for decoding digital motion picture 
signals, blocks each composed of NxM pixels are regener- 
ated in a reverse [procdyre] procedure , that is, processes of 
variable length decoding, reverse quantization, reverse 
orthogonal transform, motion compensation, etc. 

The above general encoding method and decoding 
method for encoding and decoding digital moving picture 
signals enable removal of redundancy contained in moving 
picture signals, and efficient communication and storage of 
a moving picture with less information. 

In the general encoding method and decoding method for 
encoding and decoding digital moving picture signals, the 
processes are conducted on each pixel block, as stated 
above. It is general that a set of pixel blocks forms a 
subframe and a set of subframes forms a frame, which are 
units processed in the general encoding and decoding 
method. 

Hereinafter, encoding and decoding of each block, sub- 
frame and frame will be described by way of an example of 
a general encoding and decoding method for encoding and 
decoding digital moving picture signals with reference to 
ITU-T Recommendation H.261 (hereinafter, referred simply 
H.261) made on March, 1993. 

H.261 defines an encoding method and a decoding 
method for encoding and decoding luminance signals and 
color difference signals, separately, of digital moving picture 
signals. However, description will be made of only the 
luminance signals, for the sake of convenience. Basically, 
the encoding method and decoding method for encoding and 
decoding the luminance signals are not different from those 
for the color difference signals. 

As shown in FIG. 1, one frame 101 of digital moving 
picture signals is composed of 352x288 pixels according to 
H.261. The frame 101 is divided into twelve subframes 102 
called GOBs (Group of Blocks) each composed of 1 76x48 
pixels (hereinafter, the subframe in the description of the 
prior art will be referred a GOB). Further, the GOB 102 
(subframe) is divided into thirty three blocks 103 called 
macro blocks each composed of 16x16 pixels. 

The encoding method according to H.261 defines that 
encoded information for one frame is corresponded to a 
spatial hierarchical structure such as the frame 101, GOBs 
102 and macro blocks 103 described above, as shown in 
FIG. 2. 

In FIG. 2, a part enclosed in a rectangle shows encoded 
information, and the number of coding bits is shown under 
each of the rectangles. In FIG. 2, arrows show linkages of 
the encoded information. A series of encoded moving pic- 
ture signal sequences as this is called a bit stream 104. 

In the bit stream 104 according to H.261 shown in FIG. 
2, a part including all encoded information for one macro 
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block 103 is called a macro block layer 103S, a part 
including all encoded information for one GOB 102 is called 
a GOB layer 102S, and a part including all encoded infor- 
mation for one frame 101 is called a frame layer 101 S. 
5 Meanings of the encoded information in each of the layers 
shown in FIG. 2 are given below: 
Frame Layer 101 S 

PSC (20 bits): a frame identifier 105; a unique code by 
which an encoding method can be always identified, 
io expressed as "0000 0000 0000 0000 000 1 

TR (5 bits): a frame number 106; indicating a time 
position in which this frame 101 should be displayed; 

PTYPE (6 bits): frame type information 107; various 
information about the frame 101 ; 
15 PEI (1 bit): extension data insertion information 108; a 
flag representing presence of following PSPARE 109; 

PSPARE (8 bits): extension data; GOB layer 102S 
(subframe) 

GBSC (16 bits): a GOB identifier 110; a unique code by 
20 which a decoding method can be always identified, 
expressed as "0000 0000 0000 0000"; 

GN (4 bits): a GOB number 111; indicating a spatial 
position of this GOB 102 within the frame 101; 

GQUANT (5 bits): quantization characteristic informa- 
25 tion 112; indicating a quantization characteristic when a 
macro block 103 in the GOB 102 is encoded; 

GEI (1 bit): extension data insertion information 113; a 
flag representing presence of following GSPARE 1 14; 
GSPARE (8 bits): extension data 1 14. 
30 Incidentally, the encoded information 115 of the macro 
block layer which is the lowest hierarchy in FIG. 2 is 
generated in the encoding method of motion detection, 
prediction, orthogonal transform, quantization, variable 
length coding, etc., as described before, whose coding bit 
35 number is not fixed. The number of coding bits of the macro 
block layer 103S, in general, increases if a spatial level of 
pixels included in the macro block 103 changes largely or a 
time level of pixels included in the macro block 103 having 
the same spatial positions changes largely. Such macro block 
40 103 is, hereinafter, referred a macro block 103 which is 
difficult to be encoded. 

To the contrary, if a level of pixels included in the macro 
block 103 is steady in relation to space and time, the number 
of coding bits of the macro block layer 103S remarkably 
45 decreases, or sometimes becomes zero. Such macro block 
103 is hereinafeter referred a macro block 103 which is easy 
to be encoded. 

In the decoding method according to H.261, the PSC 105 
which is an identifier of the frame layer 101 S is first found 

so out from the bit stream 104. Incidentally, in a state where a 
decodable code has been successfully found out it is said that 
synchronization is established. When the PSC 105 is found 
out from the bit stream and synchronization of the frame 
layer 101 S is established, it can be identified that the bit 

55 stream 104 until the next PSC 105 appears is encoded 
information for one frame. Further, a time position in which 
the frame 101 composed of 352x288 pixels obtained by 
decoding the bit stream 104 for that one frame can be 
obtained by examining the frame number 1 06 following the 

6o PSC 105. 

After the establishment of the frame layer, a GBSC 110 
that is an identifier of the GOB layer 102S is found out from 
the following bit stream 104 in the encoding method accor- 
ding to H.261. When synchronization of the GBSC layer is 
65 established, it can be identified that the bit stream 104 until 
the next GBSC 110 appears is encoded information for one 
GOB 102. Further, a spatial position of the GOB 102 
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composed of 176x48 pixels obtained by decoding the bit 
stream 104 for that one GOB 102 in a frame 101, in which 
the GOB 102 should be placed, can be obtained by exam- 
ining a GN 111 which is a GOB number following the 
GBSC 110. 

In the decoding method according to H.261, a bit stream 

104 of a following macro block layer 103S is decoded after 
the establishment of the GOB layer 102s. The decoding 
method of the macro block layer 103S is a procedure to 
regenerate a macro block 103 composed of 16x16 pixels in 
processes of variable length decoding, reverse quantization, 
reverse orthogonal transform, motion compensation, etc., as 
described before. It should be here noted that the macro 
block layer 103S has no unique code by which a decoding 
method can be always identified dissimilarly to the PSC 105 
or BGSC 1 10, and encoded information of each macro block 
is composed of undefined length bits of a variable length 
code. 

As shown in FIG. 3, in the GOB (subframe) layer 102S, 
the encoded information from the first macro block 115, to 
the thirty third macro block 1 15 3 3 is expressed as a series of 
variable length codes without a unique code. If decoding of 
the macro block encoded information is initiated from a 
point indicated by A in FIG. 3, and successively conducted 
in the order of the first, the second.... the nth.... the thirty 
third macro blocks, it is possible to regenerate all the macro 
blocks 103 in the GOB layer 102S. However, if the decoding 
of the macro block encoded information is initiated from a 
point indicated by B or C in FIG. 3, it is impossible to 
identify a point from which encoded information 1 15 of one 
macro block starts, which leads to a failure of establishing 
synchronization. In which case, the decoding and regener- 
ating all macro blocks 103 become unfeasible until the next 
GBSC 110 appears. In other words, the GBSC 110 also 
represents a starting point of decoding the macro block layer 
103S. 

Finally, in the decoding method according to H.261, the 
GOB 102 which is a set of regenerated macro blocks 103 is 
placed in a spatial position within a frame 101 directed by 
GN 1 1 1, and the frame 100 which is a set of the regenerated 
GOBs 102 is placed in a time position directed by TR 106. 

As above, it is possible to decode one frame 101 of digital 
moving picture correctly in relation to space and time 
according to H.261. 

However, the above general method for encoding and 
decoding digital moving picture signals has a drawback that 
if a part of a bit stream 104 [lacks] is lacking or an error 
occurs therein, it might be impossible to accurately decode 
all subframes (GOBs) 102 in relation to time until 
synchronization of the next frame layer 101 S is established. 

The reason of the above is that codes which can be 
identified at all times in the bit stream 104 are only the PSC 

105 which is a frame identifier and the GBSC 1 10 which is 
a subframe identifier in the general decoding method. If a 
part of the bit stream 104 lacks or an error occurs therein, it 
is impossible to recover synchronization of the decoding 
until the next GBSC 110 appears so that the decoding 
becomes unfeasible. Even if the next GBSC 1 10 appears, the 
bit stream 104 of that subframe layer 102S cannot be 
correctly decoded in relation to time. This will be under- 
stood from FIG. 4. 

FIG. 4 shows an example where the fifth GOB 102 5 in the 
nth frame lOln through the sixth GOB 102 6 in the (n+I)th 
frame [101 n -i] lOln+j cannot be decoded in relation to time 
due to [lacks] lacking portions or errors of the bit stream 

104 occurring in burst. In this example, not only the PSC 

105 corresponding to the (n+l)th frame in relation to time 
but also the following TR 



106 are missed or in error. It is therefore possible to 
correctly decode the GOB 1027 in relation to space by 
establishing synchronization from the GBSC 110 
corresponding to the seventh GOB 102 7 in the (n+l)th frame 
101 n+ i in relation to time and decoding the following GN 
5 111, but impossible to specify whether this GOB 102 7 
positions in the nth frame or in the (n+l)th frame in relation 
to time. 

In terms of decoding of the eighth GOB 102 8 through the 
twelfth GOB 102 12 in the (n+l)th frame in relation to time, 
it is impossible to specify whether these GOBs 102 position 
in the nth frame or in the (n+l)th frame in relation to time. 

In consequence, if a part of the bit stream 104 is [missed] 
missing or an error occurs therein, it becomes impossible to 
correctly decode all GOBs 102 in relation to time until 
synchronization of the next frame layer 101 5 is established. 

15 Further, the general method for encoding and decoding 
digital moving picture signals has another drawback that if 
the GOB 102 including a picture in motion in relation to 
time cannot be decoded, a picture quality of the reproduced 
picture is largely degraded. 

20 This problem will be described in more detail with 
reference to FIG. 5. FIG. 5 shows one frame including 
decoded signals of a moving picture, where a figure is 
moving in the center of the frame. In FIG. 5, a part moving 
in relation to time is indicated by slanting lines, and the 

25 remaining part is a background which is still in relation to 
time. A scene like this is general in TV conferences, TV 
telephones or the like. 

Referring to FIG. 5, considering that any one of the first 
GOB 102] through the fourth GOB I02 4 cannot be decoded. 

30 The first through fourth GOBs 102i through 102 4 include a 
picture still in relation to time. If the second GOB 102 2 
cannot be decoded, for example, a skillful operation is 
conducted to substitute the second GOB 102 2 of the present 
frame 101 with the second GOB 102 2 of the preceding frame 

35 lOl.i in the decoding. With this operation, degradation of a 
picture quality in the second GOB 102 2 of the present frame 
101 may be hardly detected. 

However, it is a problem if decoding of the fifth through 
twelfth GOBs 102 5 through 102i 2 shown in FIG. 5 cannot 

40 be decoded. The fifth through twelfth [GOSs] GOBs 102 5 
through 102i2 include a picture moving in relation to time. 
This means, for example, that a picture in the ninth GOB 
102 9 of the preceding frame 1 0.1 -1 is largely different from 
the ninth GOB 102 9 of the present frame 101 in relation to 

43 time. If the decoding of the ninth GOB 102 9 is unfeasible, 
degradation of the picture quality of the ninth GOB 102 9 of 
the present frame 101 is obviously detected even if the 
skillful operation mentioned above is conducted in the 
decoding. 

50 Accordingly, if decoding of GOB 102 including a picture 
moving in relation to time becomes unfeasible, a quality of a 
reproduced picture is largely degraded. 

SUMMARY OF THE INVENTION 

55 In the light of the above problems, an object of the present 
invention is to provide a method for encoding and decoding 
digital moving picture signals, which can appropriately 
decode subframes (GOBs) following a subframe in trouble 
in relation to time if a part of a bit stream is missing or an 

60 error occurs in the bit stream. 

Another object of the present invention is to provide a 
method for encoding and decoding digital moving picture 
signals, which can suppress degradation of a reproduced 
picture to a small extent if decoding of a subframe (GOB) 

65 including a picture in motion in relation to time becomes 
unfeasible. 

To accomplish the first object, the present invention is 
featured in that in the method for encoding and decoding 
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digital moving picture signals of this invention, time posi- 
tion information representing an order of displaying a sub- 
frame is attached to an identifier of the subframe by which 
the subframe is identified. 

According to the method for encoding and decoding 
digital moving picture signals of this invention, time posi- 
tion information representing an order of displaying a sub- 
frame is attached to an identifier used to identify the sub- 
frame and the identifier of the subframe is encoded. It is 
therefore possible to decode subframes following a sub- 
frame in trouble appropriately in relation to time if a part of 
[bit stream] a bit stream is missing or an error occurs in the 
bit stream by using the time position information 
representing an order of displaying each of the subframes 
attached to an identifier used to identify the subframe 

To accomplish the second object, the present invention is 
featured in that in the method for encoding and decoding 
digital moving picture signals of this invention, the number 
of blocks included in a subframe is varied according to a 
sum of quantities of generated information of the blocks 
included in the subframe so that each of all the subframes 
included in the frame has an equal sum of quantities of the 
generated information of the blocks included in the sub- 
frame. 

According to the method for encoding and decoding 
digital moving picture signals of this invention, the number 
of blocks included in a subframe is varied according to a 
sum of quantities of generated information of the blocks 
included in the subframe so that each of all the subframes 
included in the frame has an equal sum of quantities of the 
generated information of the blocks included in the sub- 
frame. In consequence, a spatial size of each subframe is not 
fixed. A subframe including a block having a large number 
of coding bits is in a smaller size, whereas a subframe 
including a block having a small number of coding bits is in 
a larger size. It is therefore possible to suppress degradation 
of a reproduced picture even if decoding of a subframe 
becomes unfeasible since a subframe including a block 
which includes a motion in relation to time and is difficult to 
be encoded is in a smaller size in relation to space. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows units to be encoded in a general encoding 
method for encoding moving picture signals; 

FIG. 2 shows a bit stream generated in the general 
encoding method for encoding moving picture signals; 

FIG. 3 shows a GOB layer in the bit stream in FIG. 2 
generated in the general encoding method for encoding 
moving picture signals; 

FIG. 4 illustrates an effect of a lack or an error of a part 
of a bit stream occurring in the general encoding and 
decoding method for encoding and decoding moving picture 
signals; 

FIG. 5 illustrates an effect of a lack or an error of a part 
of a bit stream occurring in the general encoding and 
decoding method for encoding and decoding moving picture 
signals; 

FIG. 6 shows a bit stream generated in a method for 
encoding digital moving picture signals according to first 
and second embodiments of this invention; 

FIG. 7 is a flowchart illustrating the method for decoding 
digital moving picture signals according to the first embodi- 
ment of this invention; 

FIG. 8 illustrates the method for encoding digital moving 
picture signals according to the second embodiment of this 
invention; and 



FIG. 9 shows a structure of subframes according to the 
second embodiment of this invention. 

DESCRIPTION OF THE PREFERRED 
, EMBODIMENTS 

Hereinafter, description will be made of embodiments 
according to the present invention referring to the drawings. 

A method for encoding and decoding digital moving 
picture signals according to a first embodiment will be now 
10 described, which may correctly decode a subframe as a unit 
in relation to time even if a part of a bit stream is missing or 
an error occurs therein. 

In the encoding method according to this embodiment, 
one frame of digital motion picture signals is composed of, 
15 for example, 352x288 pixels. The frame is divided into 
twelve subframes each composed of, for example, 176x48 
pixels. Further, the subframe is divided into thirty three 
blocks 13 each composed of, for example, 16x16 pixels. 
20 The encoding method according to this embodiment cor- 
responds encoded information for one frame to a spatial 
hierarchical structure made up of a frame 11, subframes 12 
and blocks 13 to generate a bit stream 14 as shown, for 
example, in FIG. 6. 
25 Meanings of encoded information of each layer shown in 
FIG. 6 are given below: Frame layer 1 IS 

FSC (20 bits): a frame identifier 15; a unique code by 
which a decoding method can be always identified, 
expressed as "0000 0000 0000 0001 0000"; 
30 Subframe Layer 12S 

SFSC (16 bits): a subframe identifier 16; a unique code by 
which a decoding method can be always identified, 
expressed as "0000 0000 0000 0001 "; 

SFNT (5 bits): a subframe time number 17; indicating a 
35 time position in which this subframe 1 2 should be displayed; 

SFNS (4 bits): a subframe space number 18; indicating a 
spatial position in which the subframe 12 should be dis- 
played; 

40 SFQUANT (5 bits): quantization characteristic informa- 
tion 19; representing a quantization characteristic when a 
block 13 in the subframe 12 is encoded. 

Incidentally, encoded information 20 in the block layer 
13 S which is the lowest hierarchy in FIG. 6 is generated in 

45 an encoding method of motion detection, prediction, 
orthogonal transform, quantization, variable length coding, 
etc., whose coding bit number are not fixed. 

Now referring to FIG. 7, a decoding method according to 
this embodiment will be now described. First, an FSC 15 

so which is an identifier of a frame layer 1 IS is found out from 
a bit stream 14 to establish synchronization of the frame 
layer 1 IS. 

After the establishment of synchronization of the frame 
layer 1 IS, an SFSC 16 which is an identifier of a subframe 

55 layer 12S is found out from the following bit stream 14 to 
establish synchronization of the subframe layer 12S. Then a 
subframe time number SFNT 17 and a subframe space 
number SFNS 18 following the SFSC 16 are examined. 
Next, a bit stream 14 of a block layer 13S is decoded. A 

60 method for decoding this block layer 13S is a procedure to 
regenerate the block in processes of, for example, variable 
length decoding, reverse quantization, reverse orthogonal 
transform, motion compensation, etc. Finally, the subframe 
12 which is a set of the regenerated blocks 13 is placed in 

6 5 time and space positions instructed by the SFNT 17 and the 
SFNS 18. If synchronization of the decoding is lost due to a 
lack of a part of the bit stream 14 or an error therein, a seek 
for the SFSC 16 which is an identifier of the subframe layer 



